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Abstract 

Quantum compiling addresses the problem of approximating an arbitrary quantum gate with a string of gates 
drawn from a particular finite set. It has been shown that this is possible for almost all choices of base sets and 
furthermore that the number of gates required for precision e is only polynomial in log 1/e. Here we prove that using 
certain sets of base gates quantum compiling requires a string length that is linear in log 1/e, a result which matches 
the lower bound from counting volume up to constant factor 



1 Introduction 

Quantum computation generalizes computer science to utilize novel quantum physical resources as elementary build- 
ing blocks for information processing |jl|, ^ ^ Q]. Quantum algorithms, like their classical analogues, can be written 
in a number of nearly equivalent ways. While a classical program is typically composed of a series of simple boolean 
functions, such as NAND and FANOUT, a quantum algorithm is typically written as a product of unitary gates, such as 
the Hadamard transform H, the controlled-NOT (CNOT), and the 7r/8-gate T [^. For classical computers, a common 
problem is that of compiling a program, in which one typically wishes to express the program in as few elementary 
operations as possible. By analogy, we can raise the principal questions of quantum compiling: which sets of gates can 
be composed to form what sorts of quantum algorithm, how many of them are necessary, and what efficient algorithms 
can be devised to express quantum programs in terms of a particular set of base gates? 

Mathematically, a gate on n quantum bits (qubits) is represented by a unitary transformation on a 2" -dimensional 
vector space. We will denote the set of all determinant-one unitary transformations of a d-dimensional vector space 
by SU{d). This space is a manifold and is hence parameterized by a continuum of real parameters; for example, the 
2x2 unitary transforms 

parameterized by a, f3, 9 represent the group SU (2) of valid single qubit gates. 

In contrast, digital quantum algorithms compute with only a finite set of base gates (such as those mentioned 
previously: H, T, and CNOT). This is a reasonable restriction in real circuit implementations, since the presence 
of noise reduces the number of reliably distinguishable gates to a finite subset of the continuous set. Finite gate 
sets are also intrinsic to fault-tolerant quantum computation, the art of constructing arbitrarily reliable circuits from 
unreliable parts. ^, ^, ^] Thus, in general we do not desire perfect computational universality, but only the ability to 
approximate any quantum algorithm, preferably without using too many more gates than originally required. 

A set of base gates A C SU (d) is computationally universal if given any gate U, we can find a string consisting of 
gates from A and their inverses, such that the product of the gates in the string approximates U to arbitrary precision. 
Equivalently, A must generate a dense subgroup of SU (d). 
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Which sets of base gates are computationally universal? It turns out that probabilistically speaking, almost all 
of them are [ p^ [lljl. If base gates are chosen at random, then all but a set of measure zero are computationally 
universal. The idea is that if the eigenvalues of the base gates have phases that are irrationally related to tt (which 
occurs with probability one), then taking powers of them allows each base gate to approximate a one-parameter 
subgroup to arbitrary precision, just as integer multiples of a random vector modulo a lattice will almost always fill 
space. Furthermore, the base gates will almost always lie on different one-parameter subgroups, which will generate 
all of SU{d) with probability one. 

Given that compiling is generically possible, it is vital to determine how short a string of base gates is typically 
required to approximate a given gate to a specified precision; this is the question we consider in this paper. The con- 
struction described by Lloyd requires using a number of base gates exponential in log \Je to achieve a precision of 
e. This is an unreasonable cost for many applications. However, Solovay [ |l2| ] and Kitaev [|l3|| have independently de- 
scribed an efficient (meaning its running time is polynomial in log 1/e) algorithm for quantum compiling that produces 
strings of length only 0(log'^(l/ e)), where c is a constant between 3 and 4. [Q The algorithm works by constructing 
successively finer e-nets; finite sets of gates that can approximate any element of SU{d) to an accuracy of e. 

On the other hand, as we will later discuss, since a ball of radius e in SU{d) has volume proportional to e"^"^^, it 
takes 0((l/e)'*'^^) different strings of gates to approximate every element of SU (d) to a precision of e. Therefore, no 
algorithm will ever be able to reduce c below 1 . Furthermore, it is unlikely that the successive approximation method 
used by the Solovay-Kitaev theorem will be able to do better than c = 2.[[l^ This still leaves open the question of 
whether some other technique could establish an upper bound asymptotically smaller than the one achieved by the 
Solovay-Kitaev theorem. 

Here, our main result is that for at least some univeral sets of base gates only 0(log 1/e) gates are sufficient to 
approximate any gate to a precision e (i.e. c = 1). This is within a constant factor of the lower bound obtained from 
counting arguments. We say that these base gates are not only computationally universal, but also efficiently universal, 
since using them for quantum compiling requires a string length that is optimal up to a constant multiplicative factor. 

We present this result as follows. The set of strings from a fixed computationally universal set of base gates cover 
SU{d) increasingly densely and uniformly, as the string length grows, [jl^ First, in Section^ we quantify how quickly 
this occurs by introducing a framework for comparing the distribution of strings with the uniform distribution. We use 
this formalism in Section ^ to identify a condition on base sets that implies their efficient universaUty. In Section ^ 
we then combine this condition with results from the literature to show that efficiently universal gate sets for Hilbert 
spaces of any finite dimension. Section || discusses lower bounds for compilation and demonstrates the optimality of 
the result; we conclude with open questions and further directions. 



2 Preliminaries 

We begin by developing a metric of how well strings drawn from a finite set of gates approximate arbitrary elements 

of SU{d). 

Let dg be the Haar measure on SU{d) normalized so that J dg — 1. Consider the Hilbert space L'^{SU{d)) with 
norm defined by the usual inner product {'ip,ip) = J V'(5)*</'(5)'i5- The norm of a linear transformation on i^(5t/((i)) 
is given by 

\M\ = sup {\\Mf\\\ f eL\SU{d)),\\f\\^l} . (2) 

When M is bounded and hermitian, the norm is simply the supremum of its spectrum and as a result, |Af"| — | Af |". 
Define a representation U i-^ U of SU {d) on L'^{SU{d)) by 

ilfix)^ fiU^^x). (3) 

Using the right invariance of the Haar measure, we see that U is unitary. For any finite set A C SU (d), define the 
mixing operator T{A) by 
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All such T are hermitian and have norm one. We will often simply write T instead ofT{A). These represent averaging 
the action of the elements of A and their inverses on a function; when the function is a probability distribution on 
SU{d) we can think of T as multiplying by a random element of A. 

Applying T" represents averaging over the action of words of length n. Denote the set of words of length n made 
up of elements of A and their inverses by Wn{A), or when the set A is understood, simply Wn- This set comprises 
(2|^|)" words, though as matrices there are generally some duplicates since substrings such as AA^^ — 1 for all 
A E A. For any positive integer n, expanding T" gives 

Y ^ (5) 

(2\A\)" ■ 

We want to compare T" to the integral operator P. 

Pf{h) - / f{gh)dg = / f{g)dg . (6) 



Note that P is the projection operator onto the set of constant functions on SU{d), and hence P = P^ and P^ = P. 
It is not hard to show that TP = P = PT and consequently 

(T - PY = T''' - P . (7) 

The metric for comparing T{A) to P is given by 

A(^) = \T{A) - P\ . (8) 

From Eq. ^ and the hermiticity of T and P, it follows that 

A(yi)" = |r"(yi)-p|. (9) 

If one thinks of T" as a Riemann sum then A serves as to quantify how quickly T" converges to the integral. It has 



been shown [15| that if is a computationally universal set that all the eigenvalues of T — P have absolute value 
strictly less than one. However, this only implies that A(^) < 1, since T — P has an infinite number of eigenvalues. 

The proof of main result of our paper — that efficiently universal sets of gates exist — is divided in the next two 
sections. In Section || we show that A(^) < 1 implies that A is efficiently universal and in Section || we demonstrate 
that for any d an efficiently universal set of gates can be found in SU (d). 

3 A condition for efficient universality 

Theorem 1 For any A C SU{d) such that h-{A) < 1, ^ is efficiently universal. Specifically, there exists a constant 
C such that for all U G SU{d), e > 0, andn > Clogl/e, there is a w € Wn{A) such that \w — U\ < e. 

Before proving the theorem, we will need to note a fact about the geometry of SU{d). For any d and rg, if V{r) is 
the Haar measure of a ball of radius r in SU{d), then there exist constants fci and k2 such that 

kir'^^-^ < V{r) < k2r'^^'^ . (10) 

for all r G (0, tq). This is true because SU (d) is a d^ — 1-dimensional manifold and because V{r) does not depend 
on the center of the ball under the Haar measure. 

Now we can proceed with the proof of Theorem |l]: 
Proof Define x e L^{SU{d)) by 

\ otherwise " ^ ' 

Let V — \\Px\\ — 11x11^ be the measure of the ball around the identity of radius e/2. We will not perform this 
integration, but recall from Eq. |o|that V > fci(e/2)''^"^ 
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Let T = T{A) and A = K{A). 

First we use the Cauchy-Schwartz inequality to give 



;x, (T" - P)c/x; < 11x11 II 
< MWt'^ - P)u\ < A"y. 

Another way to compute the same inner product is 

[XAT'' - P)Ux) = (x,T''Ux) -V\ 



(12) 



(13) 



Combining Eq. [l| and Eq. [l| gives that (x, T"C7x) - 



< A" V^. This means that there exists C which depends 



only on A such that if n > C log 1/e then A" < V and (x, T'^Ux) > 0. Specifically, it suffices to choose 

">bi(l7A)l°s(l/^)+ log(l/A) ■ 

xiwU^^g) 



When this occurs it means that 



/ X{9) E 



weWn 



-dg>0, 



(14) 



(15) 



which implies that 3g E SU{d) and w G Wn such that xig) 7^ and x{wU ^g) ^ 0. Thus \g — /| < e/2 and 
\wU~^g — /| < e/2, implying that \w — g~^U\ < e/2. Combining these and using the triangle inequality gives 
\w-U\<e.U 



4 A class of efficiently universal gate sets 

In this section we show that for each d there exists a set of gates Qd in SU{d) such that A{Qd) < 1 (and thus Qd is 
efficiently universal). We begin with a result demonstrating this for SU (2) and then extend it to SU (d). 

Lemma 2 (Lubotsky, Phillips and Sarnak) Let 

andV, ^ , \A. (16) 



Then A = A{{Vi,V2, V3}) = ^ < L Furthermore, for any [/i, C/2, U3 G SU{2), A{{Ui, [/a, U3}) > A 



The proof of this Lemma is presented in [g_6[ |17|]. Let Q2 — {Vi, V2, V3}, as it is a family of quantum gates from 
SU{2) for which A is strictly less than one. The optimality of A for this set is an interesting aside, but has little bearing 
on what follows. 

Extending the result to SU{d) will require slightly more effort. To this end, if Ik denotes the k x k identity matrix, 
then, for any U G SU{2) and 2 < j < d, define (U) to be 



Ij-2 



Id-j 
We will typically omit the '^'^^ where it is understood. 



(3l'^\u) =1 U I e SU{d). (17) 
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Lemma 3 (Diaconis and Shahshahani) Let {G^}, l<i<j<dbea series of (2) independent random matrices in 
SU{2) that are chosen uniformly according to a Haar measure. Then 



d-l d 

n n p^^G]) 



(18) 



is uniformly distributed in SU{d). 



This Lemma is proved in [|18p. It means that if we had access to random elements of SU (2) that were completely 
uniformly distributed, then we could generate uniformly distributed elements of SU{d). When the elements of SU{2) 
are only approximately uniform, we can bound the distance to uniformity of the words they form by using what is 
known as a hybrid argument: 



Lemma 4 (Bernstein and Vazirani) IfUi, 
and \Ui — Vi\<S, then \Um ■ ■ ■ U2U1 — ¥„, 



, Urn, Vi, . . . , Vm are linear operators such that \Ui\ < 1, \Vi\ < 1 
■V2Vi\ < mS. 



Proof If we replace a single Ui in the product Um ■ ■ - Ui with the corresponding Vi, then the entire product will still 
change by less than 6 since \ AB\ < • |_B| for any operators A, B. Thus we can construct a series of m + 1 "hybrid" 
operators, which start with Ui - ■ ■ Um, end with Vi • • • Vm and are each separated by less than 5. The proof follows 
from the triangle inequality. ■ 

We now combine all of the other results in this section to demonstrate a set of gates in SU{d) for which A is strictly 
less than one. 

Proposition 5 For any d > 2, define Qd by 

Qd = iPjiV) 1 1 < J < (d - 1), y e ^2} . (19) 

Then AiGd) < 1. 
Proof 

The approach of our proof will be to approximate the uniform distribution in Lemma |^, and then we show that this 
forces A to be less than one. To this end, let Rm C VF^^-d-j (Gd) be the set of all products of the form 



d~l d 

n n i^m) 



(20) 



such that the are selected from WmiG2)- 

From Lemma | we have that Vm, \T(yi,V2, V3) - P\' 



A™ for some A < 1. There are („) terms in Eq. nSl 



each of which is approximated to within an accuracy of A"* by the appropriate length m substring of Rm- Thus, using 
the hybrid argument and Lemma || gives that 



Now, if we let R' denote W„ 



Rm then 



E 



w-P 



\w\ 



\R. 



< 



\Rr. 



(21) 



< 



1 - 



\Rn 



\w\ 



\w\ 

\Rm 



\w\ 



k{Rm) 

-A(i?„) 



= 1 - 



\Rm 

W\ 



■(1-A(i?„)) 



(22) 



If we choose m large enough so that (o)^™ < 1> then this last expression will be less than one, and K{Qd) < 1. 



Thus, efficient quantum compiling is possible for d-dimensional systems, given the appropriate choice of base gate 



set. 
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5 Lower Bounds 



This proves that sets of base gates exist which can achieve a precision of e in 0(log 1/e) gates, but can we do any 
better? An e-ball in SU{d) has measure of order e"^ so if we expect to cover all of SU{d) with strings of length n, 
then we will require {2\A\)"'k2€'^ ~^ > 1, or equivalently. 

Thus the result is optimal up to a constant factor This fact is quite general, since it follows from simple counting 
arguments. However, if the assumptions of the problem are relaxed to allow many gates to act in parallel, then using 
ancilla qubits it is possible to approximate single-qubit gates with a circuit of size poly(log 1/e) but depth of only 
poly(loglog l/e).[?] This construction, like the one in this paper, relies on having access to a specific set of base gates; 
to date, only the Solovay-Kitaev theorem applies to any computationally universal set. 

In our original problem, though, eliminating the constant linear factor turns out to be impossible. Consider any 
set A of I base gates that is not computationally universal. Let B{A, S) be the set of gates obtained by perturbing 
each gate in A by no more than 6. Then B{A, S) has non-zero measure (in SU{dy), almost all of its elements are 
computationally universal and from the hybrid argument, any string of length n drawn from gates in B{A, S) will be 
within n5 of something in the (non-dense) group generated by A. Since we can make 5 arbitrarily small, any fixed 
prefactor in front of log 1/e will fail on a computationally universal set of non-zero measure for some values of e. 

Note that unlike most results about quantum compiling, this argument also holds if the base gates are parameter- 
ized; say, Ai, . . . ,Ai are elements of the algebra su{d) and a single operation now has the form e^^'*, for any t > 0. 
The above proof demonstrates that there exist sets with non-zero measure which require arbitrarily many steps, even if 
the steps are continuous. If we measure cost not in terms of number of steps, but by the total time taken, then we have 
to modify the argument slightly. For small values of t, \e^^* — I\ is on the order of tS, but for large t the difference 
never gets any higher than 6. This means that no matter how many steps we take, in time t, we will stay within t5 of 
some non-dense subgroup and the same result holds. 

These results can be obtained more simply by considering the (non-zero measure) set of gates which are very close 
to the identity. If every gate does very little, then we will need a large number them in order to accomplish anything. 
The reason why universal sets that are very close to non-universal sets are interesting is because of their frequent 
appearance in actual physical systems, such as NMR under the weak coupling approximation. 



6 Conclusions 

We have found a condition that implies the efficient universality of a set of gates and demonstrated a family of gate sets 
in SU{d) that satisfy this condition. This means that given access to such a gate set, arbitrary quantum gates can be 
approximated to accuracy e using only 0(log 1/e) gates. Such knowledge will likely be invaluable in crafting future 
physical implementations of quantum information processing systems. 

Many open questions remain, however. For example, determining or bounding A (even numerically) for a given 
set of base gates seems to be very difficult, though it is likely an important step in determining the prefactor C, which 
measures how effective a set of gates would be for compiling. The method used by JI^ p^ ] involves specialized 
arguments from number theory that do not generalize easily to other sets of gates or to SU{d) for d > 2. Our proof 
(like the Solovay-Kitaev algorithm) also requires the ability to perform the inverse of each gate in the base set. This 
restriction feels unnecessary, yet very little is known in the case where inverses are unavailable. 

More broadly, it is also generally unknown which gate sets are efficiently universal and when A < 1. Note that 
A(^) < 1 implies that A is efficiently universal, but the converse is not known to be true. Thus it is possible that the 
questions of efficient universality and A being less than one will be settled separately. 

However, if A(^) were to be a continuous function of A (for fixed |^|), then the situation would simplify con- 
siderably. In this case, it is not hard to show that A(^) < 1 if and only if A is computationally universal, so that 
computational universality, efficient universality and A < 1 would all become equivalent conditions. We suspect that 
this is the case, but have been unable to prove it. 

Finally, the techniques used in our results do not suggest any efficient (i.e. running time polynomial in log 1/e) 
algorithms for quantum compiling. The most important, and possibly most difficult, open problem remaining is to find 
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a polynomial time algorithm to approximate any unitary gate by a fixed efficiently universal set of base gates with a 
string whose length saturates the 0(log 1/e) bound. 

We thank Persi Diaconis, Michael Freedman, Neil Gershenfeld, David Jerison, Seth Lloyd and Michael Nielsen 
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earlier version of this paper and sharing with us a draft of [?]. AWH acknowledges support from the Army Research 
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